Robust sound localization using multi-source audiovisual information fusion
نویسندگان
چکیده
This paper illustrates the synergic advantages of a multi-modal sound localization system utilizing 2 cameras and a 3-element microphone array. The 2 cameras were used as part of a stereo feature-detection based visual object localization system, while the microphones were combined to produce a sound localization system incorporating a Temporal Power Fusion (TPF) algorithm. The cameras and microphones were integrated using spatial likelihood functions (SLFs), which greatly simplifies the integration process. Test results show a significant improvement in the integrated vision and sound localization (IVSL) system’s ability over that of the stand-alone microphone-array based sound localization system to accurately localize sound sourcess in low signal to noise situations. The IVSL system maintained an average error of 15cm at signal-to-noise ratios as low as 0.5 dB.
منابع مشابه
The impact of wind-generated bubble layer on matched field sound source localization in shallow water (Research Article)
This paper investigates the effect of the wind-generated bubble layer on the underwater sound source localization in the Persian Gulf shallow-water environment through computer simulation and the matched field processing technique. An underwater sound source of 2-10 kHz located at depths of 10, 45, and 75 m was considered at a distance of 4 km from a linear vertical receiver array. The estimati...
متن کاملAudio Vision: Using Audio-Visual Synchrony to Locate Sounds
Psychophysical and physiological evidence shows that sound localization of acoustic signals is strongly influenced by their synchrony with visual signals. This effect, known as ventriloquism, is at work when sound coming from the side of a TV set feels as if it were coming from the mouth of the actors. The ventriloquism effect suggests that there is important information about sound location en...
متن کاملState estimation of meetings by information fusion using Bayesian network
In this paper, a method of structuring the multi-media recording of a small-sized meeting based on various information such as sound source localization, multiple-talk detection, and the detection of non-speech sound events, is proposed. The information from these detectors is fused by a Bayesian network to estimate the state of the meeting. Based on the estimated state, the recording of the me...
متن کاملAudiovisual Person Tracking with a Mobile Robot
Mobile service robots are recently gaining increased attention from industry as they are envisaged as a future market. Such robots need natural interaction capabilities to allow unexperienced users to make use of these robots in home and office environments. In order to enable the interaction between humans and a robot, the detection and tracking of persons in the vicinity of the robot is neces...
متن کاملSound Source Localization Using a Profile Fitting Method with Sound Reflectors
In a two-microphone approach, interchannel differences in time (ICTD) and interchannel differences in sound level (ICLD) have generally been used for sound source localization. But those cues are not effective for vertical localization in the median plane (direct front). For that purpose, spectral cues based on features of head-related transfer functions (HRTF) have been investigated, but they ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Information Fusion
دوره 2 شماره
صفحات -
تاریخ انتشار 2001